Picture for Junnan Zhu

Junnan Zhu

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

ReTool-Video: Recursive Tool-Using Video Agents with Meta-Augmented Tool Grounding

Add code
May 13, 2026
Viaarxiv icon

TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents

Add code
May 11, 2026
Viaarxiv icon

Parser-Oriented Structural Refinement for a Stable Layout Interface in Document Parsing

Add code
Apr 03, 2026
Viaarxiv icon

The Trinity of Consistency as a Defining Principle for General World Models

Add code
Feb 26, 2026
Viaarxiv icon

MentalSeek-Dx: Towards Progressive Hypothetico-Deductive Reasoning for Real-world Psychiatric Diagnosis

Add code
Feb 03, 2026
Viaarxiv icon

FocalOrder: Focal Preference Optimization for Reading Order Detection

Add code
Jan 12, 2026
Viaarxiv icon

BayesRAG: Probabilistic Mutual Evidence Corroboration for Multimodal Retrieval-Augmented Generation

Add code
Jan 12, 2026
Viaarxiv icon

PARL: Position-Aware Relation Learning Network for Document Layout Analysis

Add code
Jan 12, 2026
Viaarxiv icon

GenProve: Learning to Generate Text with Fine-Grained Provenance

Add code
Jan 08, 2026
Viaarxiv icon

Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains

Add code
Aug 26, 2025
Figure 1 for Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Figure 2 for Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Figure 3 for Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Figure 4 for Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Viaarxiv icon